|
|
Accession Number |
TCMCG021C34643 |
gbkey |
CDS |
Protein Id |
XP_019710706.1 |
Location |
complement(join(21974526..21974592,21975619..21975929,21979657..21980763,21984152..21984598,21985018..21985098,21985185..21985340,21991342..21991446,21991592..21992198,21992450..21992631,21992884..21993064,21993303..21993406,21994225..21994353,21994447..21994543,21994941..21995068,21999717..21999776,21999878..21999997,22000075..22000155,22000238..22000423,22000739..22000937,22001958..22002038,22005890..22006080,22006385..22006460,22015822..22015955,22016039..22016107,22016398..22016445,22016532..22016606,22018426..22018506,22019571..22019633,22019733..22019792,22021983..>22022024)) |
Gene |
LOC105058689 |
GeneID |
105058689 |
Organism |
Elaeis guineensis |
|
|
Length |
1842aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA268357 |
db_source |
XM_019855147.1
|
Definition |
THO complex subunit 2 isoform X1 [Elaeis guineensis] |
CDS: CTTACGATGCCTGGAGATTGTCGTGCTCGTCTCATCAAAATGGCAAAATGGCTTGTCGAGTCTTCGTTGGTTCCATCTAGGCTTTTGCAAGAGAGTTGTGAGGAAGAATTTCTGTGGGAGTCTGAATTGAATAAAGTAAAGGCTCAAGATTTGAAGGCCAAAGAGGTTAGAGTAAACACCCGCCTTCTTTATCAGCAAACAAAATTCAATCTTCTACGAGAGGAGAGCGAGGGCTATGCCAAACTGGTGACGCTTCTTTGTCAGGGTGGTTTAGATTTGACAACTGAGAATACATCAACAGTGACAATTAGCATAATTAAGTCATTAATTGGGCACTTTGATCTGGACCCTAATCGTGTTTTTGATGTTGTGTTGGAGTGTTTTGAACTTTATCCTGAGAATGCTGCTTTTTATAATCTCATTCCTATATTTCCAAAGTCACATGCTGCTCAGATTTTGGGGTTTAAGTTCCAGTATTATCAACGTATGGATGTGAACACACGTGCTCCTTCCAGTCTTTATCAGCTTACAGCTTTGCTGGTGAAAGCAAACTTTATTGATCTTGATAATATATATGCACACTTACTTCCAAAGGATGATGATGCATTTGAGCACTATGATGCATTTACTGCAAGACGGTTTGATGAGGTCAACAAAATTGGCAGAATTAATCTTGCTGCTACAGGAAAAGACCTTATGGACGATGAGAAACAAGATGTGACTATTGATCTGTTTTCTGCTTTGGACATGGAAAATGATGCTATTACAGAACAAGCACCTGAGGTTGAAAATAATCAGAAACTTGGTTTGCTTATTGGTTTTATTTTTGTTGATGACTGGTACCATGCTCAGATACTATTTGATCGTCTGTCCCATCTAGATCCCGTTCAGCATATCCAAATTTGCGAGGGCCTATTTAGGGTCATTGAGAAGACTATGTCTGCAGCCTATGCTATTGTCTATCAAACACATCTTCAAAGTCGTGCTGGTTCCAATGTTGTGGAATCGACAGCTGGATCTTCTATCCAGAATTCTTCTATTGATCTCCCTCGTGAATTTTTTCAGATGCTTGCTGCTGCAGGACCATATCTTCATCGTGATGCTGTACTGCTTCAGAAGGTGTGCAGAGTGTTGAGAGCATACTACCTCTGTGCTGAAGAATTAGCTGGCCTTCGAGCTAAAGAAGCTAAGCTTAGGGTTGAAGAAGCACTTGGAAAATGTGTGCTTCCTTCATTGCAATTAATACCTGCAAATCCTGCAGTTGGGCAAGTGATATGGGAACTCCTTTCTCTGCTCCCCTATGAGGATCGATATCGCCTGTATGGTGAATGGGAAAAGGATGATGAAAGAATCCCAATGGTTCTGGCTGCAAGGCAGATTGCAAAGTTGGACACTAGAAGAATACTGAAAAGGCTTGCAAAGGAAAATTTGAAGCAGCTGGGTCGCATGGTGGCCAAACTTGCTCATTCTAATCCGATGACTGTGCTTCGCACAATTGTTCACCAGATTGAGGCATACAGGGATATGATAACACCAGTTGTAGATGCCTTCAAGTACTTGAGACAGCTGGAATATGATGTGTTAGAATATGTTGTAATCGAACGTCTAACACAAGGAGGACGTGAGAAGCTTAAAGATGATGGCCTGAATTTGTCAGATTGGCTTCAGTCTCTTGCATCCTTTTGGGGCCATCTGTGTAAGAGGTACCCTTCAATGGAGTTGAGAGGCCTTTTTCAGTATCTTGTTAATCAATTGAAGAAGGGCTCAGGAATTGAGCTTGTTCTGTTGCAGGAGCTTATTCAGCAGATGGCCAATGTTCAATACACTGAGAACATGACTGAGGAGCAACTTGATGCTATGGCAGGAGGTGAAACATTGAGATATCAAGCTACTCTATTTGGAATGACTATAAACAATAAGGCATTGACTAAATCTACCAACAGACTTAGGGACTCCTTACTTCCGAAGGAAGAGCCTAAGCTGGCTATTCCTCTTTTGTTACTAATAGCTCAACATCGCTCCATGGTTATCATAAATGCGGATGCATTATACATCAAAATGGTTAGCGAGCAGTTTGACAGGTGCCATGGCATGCTTCTTCAATATGTTGAGTTTCTGTTGAGTGCCATAACTCCATCTATGATCTATGCTCAGCTGATTCCTCCTCTAGATGATCTTGTTCACAAGTACCATCTTGATCCAGAGGTAGCATTTCTGGTATATCGCCCTGTGATGAGGCTCTTCAAAAGTATAAGTGGAGCTGAAATATGCTGGCCTCTTGACATAACTGAAGAGCCCAATGTTTCAAGCACAAATGAAGAAGCAGAGCCTTCATATATATCCTGTGATGTTGTTTTGGATCTTGGATCACCCTGGAGGCCTGTCAATTGGTCAGACCTTCTTGACACAGTCCGGTCAATGCTGCCTCAGAAAGCTTGGAATAGCCTCTCTCCTGATCTTTATGTTACATTTTGGGGGCTTACACTCTACGATCTTTATGTTCCTCGACACCGTTATGAATCAGAGATCACAAAGCAGCATGCTGCTATTAAAGCCTTGGAAGAACTTTCTGACACCTCCAATATGGCTATCACAAAGCGGAAAAAAGACAAGGAAAGGATCCAAGAGCTACTTGACAGATTGAGTTGTGAATTTCAAAAGCATGAACAACATGTTGCATCTGTGCGCCAAAGGCTTAGTCATGAGAAGGACAAATGGCTGAGTTCCTGCCTGGATACTCTAAAGATAAACATGGAGTTTCTTCAACGATGCATCTTCCCACGCTGCATCTTCAGCATGCCAGATGCTGTGTATTGCGCTATGTTTGTGCATACGCTACATTCACTTGGCACACCATTTTTTAACACGGTCAACCATATTGATGTTCTTATATGTAAAACCCTACAGCCGATGATCTGTTGCTGCACCGAATTTGAAGCTGGCAGACTTGGAAGGTTTTTATATGAGACACTAAAGATGGCTTACCATTGGAAGAGTGATGAGTCCATATATGAACATGAATGTGGAAACATGCCAGGGTTTGCTGTCTATTACAGATTCCCAAACAGTCAGCGTGTAACTTATAGCCAATTTATTAGAGTACACTGGAAATGGAGTGGAAGAATAACCAGATTGCTTGTGCAGTGCTTGGAATCTACTGAGTACATGGAAATACGAAATGCTCTTATTGTGTTGACAAAAATTTCTAGTGTTTTCCCTGTTACTCGGAAGAGTGGTATTAATCTTGAAAAGCGGGTAGCTAAAATTAAAGGGGATGAGAGAGAAGATTTGAAAGTTTTGGCTACTGGTGTAGCTGCCGCTTTGGCTGCACGCAAGGGTTCATGGGTTTCTGAGGAAGAATTTGGTATGGGTCATATTGATCTAAAGCATGCAGCAGCATCAACAAAATCACCTGCTGGTAACCTGGGCAATGCACCAAATGGTTCTGCTCTTGGTATATCTCAGAATGAGATGTCTGGGACAAGGAATGCCACTACGGGGAATCAGGTAGCGGATCCATTGGATATAATTAAAGATCGGATGACACGTGCAAAATCTACAGATGGCAGGTCGGATCGATCAGAAGATGGAGTGCTTTTGAAAGCTGATTCAGCACAACAAAAATCAAGGAGTAGCTCTTCCATGAATGGGCCTGATAGTCAAACACATGCTTCTTTGCTGCCTAAGCCTTCTGGGATCATGAAAAATTTAGATGAACTTCTAAAAGTTTCACCGGAGGAAACATCTACAAAAGTTGCTTCAAAGGGCACTGTGGAATCTGAGACCAGACCACTGCAAAAACGTTCAGCACAGAATTCTCTTGGTAGGCTACCAAAACAAGAGTTGGTCAAAGGAGATGCTAAATCTAGAAAATCAATCAGCAGAACTGCCTATCAGCAATTTTCTGCAATGGCTGACAGGGATCTTTCAGCTCATCAATCAGAGAGTAGGCAAGGTGATACTGCTATGAATTCTTCATCCACTTCCTGTGGTAACTTAAATTCATCAGGAAAAGTAGCAAGTTCCTCCTCAAAAATGAATGATGTGCATGTTAGTGTATCCAAGATGGACAGTGGACCTCTCAAACCCTTGGATGACACTGTAGAAGCGCCTGATGCTTTCCCTAAAGAGCAAAAGAGATTTGCTTCAGCTGAAGAACGAGATAGATCGAACAAACAAAGGAAAGGAGACATGGACGGAAAGGATGGTGAGGCTATGGAAGTTCGATTATCTGATAAAAATAGAATTTTTGATGCCAGATCAATGGATAAATCTCACTTTTCAGATCATGAGAGGCCTAAAATTGAAGAACAAAGTCCCATTAGGCCTGTGGATAAGCATTCTGATAAATCTAGAGATAAAACTATTGAAAGATATGATAAAGACCACAGGGAAAACTTGGATCGGCCTGATAAGAGCCATGGTGTGGACATTCTTGAGAAATCAAGAGATAGGTCAATTGAAAGACATGGAAGAGAACGTTCTGTTGAAAGAGTGCAGGAGAGAGCAGCAGATAGGAATATAGATAGGTCTGTTGATAAATCTAGAGATGACAGAAGCAAAGATGATAGGAATAAATCGCGACACAATGAGGCTCCCATGGATAAGGTGCATTCCGATGAGCGTTTTCATGGACAAGGTTTGCCGCTGCCACCTCCACTACCTCCAAGTTTTGTTCCCCAATCTGTCGGTGGTAGTCAAAGAGATGAAGACCCTGAAAGAAGGGTCGGTAACACTAGACACACACAGCATCTGTTGCCTAGGCATGATGAAAAAGAATGTAGGCGCTCAGAGGAGAATGTTTTAGCATCACAGAATGATGCAAAACATAGAAGAGATGATGAGTTTCGAGAAAACAAGTGGGAGGAACAAGGAGATGTGCCAAATAAGGTAGAAGAGAGGGACAGAGAGAAGGGGAATGTACTGAAGGATGATACGGACCCTACTGCAGCCCCCAAGCGGCGAAAGCTTAAAAGGGACCATACATCCTCTTCTGAAGCTGGTGGGAAGTATTTACCATTTGTTCCGGGACCGCCGCCACCACCAAGACTAGCATTGGGGATATCTCAATCGTTTGATGCAAGAGAAAGGGGAGATAGAAAAGGGATTATGGTGCAGCATCGAGCTGTTTACATGGATGAAGTTCCAAGGGTGCATGGCAAAGAAGCCGCAAGCAAGATCAATCGTCGGGAGACTGATCAGATATATGAAAGAGAGTGGGAAGAAGAGAAGCGAAGGACTGAAGCTAAGAGGAAGCATCGGAAGTAG |
Protein: MSVQSPEFKYITEGCLQEWKASNAAFKLPDPVPMNRFLYELCWAMVRGDLPFQKCSVALGSVVFVEEQQRVEMASIIADIIAHMGQDLTMPGDCRARLIKMAKWLVESSLVPSRLLQESCEEEFLWESELNKVKAQDLKAKEVRVNTRLLYQQTKFNLLREESEGYAKLVTLLCQGGLDLTTENTSTVTISIIKSLIGHFDLDPNRVFDVVLECFELYPENAAFYNLIPIFPKSHAAQILGFKFQYYQRMDVNTRAPSSLYQLTALLVKANFIDLDNIYAHLLPKDDDAFEHYDAFTARRFDEVNKIGRINLAATGKDLMDDEKQDVTIDLFSALDMENDAITEQAPEVENNQKLGLLIGFIFVDDWYHAQILFDRLSHLDPVQHIQICEGLFRVIEKTMSAAYAIVYQTHLQSRAGSNVVESTAGSSIQNSSIDLPREFFQMLAAAGPYLHRDAVLLQKVCRVLRAYYLCAEELAGLRAKEAKLRVEEALGKCVLPSLQLIPANPAVGQVIWELLSLLPYEDRYRLYGEWEKDDERIPMVLAARQIAKLDTRRILKRLAKENLKQLGRMVAKLAHSNPMTVLRTIVHQIEAYRDMITPVVDAFKYLRQLEYDVLEYVVIERLTQGGREKLKDDGLNLSDWLQSLASFWGHLCKRYPSMELRGLFQYLVNQLKKGSGIELVLLQELIQQMANVQYTENMTEEQLDAMAGGETLRYQATLFGMTINNKALTKSTNRLRDSLLPKEEPKLAIPLLLLIAQHRSMVIINADALYIKMVSEQFDRCHGMLLQYVEFLLSAITPSMIYAQLIPPLDDLVHKYHLDPEVAFLVYRPVMRLFKSISGAEICWPLDITEEPNVSSTNEEAEPSYISCDVVLDLGSPWRPVNWSDLLDTVRSMLPQKAWNSLSPDLYVTFWGLTLYDLYVPRHRYESEITKQHAAIKALEELSDTSNMAITKRKKDKERIQELLDRLSCEFQKHEQHVASVRQRLSHEKDKWLSSCLDTLKINMEFLQRCIFPRCIFSMPDAVYCAMFVHTLHSLGTPFFNTVNHIDVLICKTLQPMICCCTEFEAGRLGRFLYETLKMAYHWKSDESIYEHECGNMPGFAVYYRFPNSQRVTYSQFIRVHWKWSGRITRLLVQCLESTEYMEIRNALIVLTKISSVFPVTRKSGINLEKRVAKIKGDEREDLKVLATGVAAALAARKGSWVSEEEFGMGHIDLKHAAASTKSPAGNLGNAPNGSALGISQNEMSGTRNATTGNQVADPLDIIKDRMTRAKSTDGRSDRSEDGVLLKADSAQQKSRSSSSMNGPDSQTHASLLPKPSGIMKNLDELLKVSPEETSTKVASKGTVESETRPLQKRSAQNSLGRLPKQELVKGDAKSRKSISRTAYQQFSAMADRDLSAHQSESRQGDTAMNSSSTSCGNLNSSGKVASSSSKMNDVHVSVSKMDSGPLKPLDDTVEAPDAFPKEQKRFASAEERDRSNKQRKGDMDGKDGEAMEVRLSDKNRIFDARSMDKSHFSDHERPKIEEQSPIRPVDKHSDKSRDKTIERYDKDHRENLDRPDKSHGVDILEKSRDRSIERHGRERSVERVQERAADRNIDRSVDKSRDDRSKDDRNKSRHNEAPMDKVHSDERFHGQGLPLPPPLPPSFVPQSVGGSQRDEDPERRVGNTRHTQHLLPRHDEKECRRSEENVLASQNDAKHRRDDEFRENKWEEQGDVPNKVEERDREKGNVLKDDTDPTAAPKRRKLKRDHTSSSEAGGKYLPFVPGPPPPPRLALGISQSFDARERGDRKGIMVQHRAVYMDEVPRVHGKEAASKINRRETDQIYEREWEEEKRRTEAKRKHRK |